๐ Sandman Tales v2
AI-Powered Multilingual Bedtime Stories
Team ClawCutters โ Mistral Worldwide Hackathon 2026
Technical 20%
Creativity 20%
Usefulness 20%
Demo 20%
Track Alignment 20%
๐ฏ Usefulness
The Quiet Crisis
27.6%
of Australians were born overseas โ ABS Census
Millions of kids grow up hearing a second language at home.
But that language fades as English takes over at school.
"I don't want Sophie to forget French. It's who we are." โ Sophie's mum
๐ก Creativity
Sandman Tales
Bedtime stories that keep heritage languages alive
๐ค
Parent speaks
in English
โ
๐ท๏ธ
AI writes story
in child's heritage language
โ
๐
Child listens
to personalised bedtime tale
10 languages ยท Personalised with child's name ยท Watercolour illustrations ยท Three audio layers
๐ก Creativity
Meet the Team
Six AI agents inspired by cultural mythology

๐ณ Papa Bois
Orchestrator

๐ท๏ธ Anansi
Storyteller

๐ฃ๏ธ Ogma
Language Guardian

๐ Devi
Voice & Audio

๐ฆ Firefly
Builder

๐ญ Jerry
Concierge
โ๏ธ Technical
System Architecture
%%{init: {"theme": "dark", "themeVariables": {"primaryColor": "#a78bfa", "primaryTextColor": "#e2e8f0", "primaryBorderColor": "#818cf8", "lineColor": "#f59e0b", "secondaryColor": "#1e1b4b", "tertiaryColor": "#0f172a", "fontSize": "14px"}}}%%
graph TB
subgraph CLIENT["๐ฅ๏ธ Frontend โ React + Vite"]
UI["Story Creator
Story Player
Story Library
Voice Picker"]
end
subgraph BACKEND["โก Backend โ FastAPI on Railway"]
API["REST API + WebSocket"]
CACHE["Prompt Cache
SHA-256 Dedup"]
ORCH["๐ณ Papa Bois
Orchestrator"]
end
subgraph MISTRAL["๐ค Mistral AI"]
AGENTS_API["Agents API
Conversations + Handoffs"]
LARGE["Mistral Large 3
Story Generation"]
VOX["Voxtral
Speech-to-Text"]
end
subgraph ELEVEN["๐ ElevenLabs"]
STT["Scribe v1 โ STT"]
TTS["Multilingual v2 โ TTS"]
STREAM["WebSocket Streaming"]
SFX["Sound Effects"]
MUSIC["Music Compose"]
VOICES["Voices API"]
EA["ElevenAgents"]
end
subgraph STORAGE["๐พ Storage"]
TURSO["Turso
SQLite Cloud
Tokyo Region"]
IMAGEN["Google Gemini Image Generation
Illustrations"]
end
UI -->|HTTPS| API
API --> CACHE
CACHE -->|cache miss| ORCH
ORCH -->|delegates| AGENTS_API
AGENTS_API --> LARGE
ORCH -->|dual STT| VOX
ORCH -->|dual STT| STT
ORCH -->|narrate| TTS
ORCH -->|stream| STREAM
ORCH -->|ambient| SFX
ORCH -->|lullaby| MUSIC
ORCH -->|browse| VOICES
ORCH -->|chat| EA
ORCH -->|illustrate| IMAGEN
API -->|persist| TURSO
CACHE -->|dedup| TURSO
style CLIENT fill:#1e1b4b,stroke:#818cf8,color:#e2e8f0
style BACKEND fill:#1e1b4b,stroke:#f59e0b,color:#e2e8f0
style MISTRAL fill:#2d1f5e,stroke:#a78bfa,color:#e2e8f0
style ELEVEN fill:#1b2e1b,stroke:#34d399,color:#e2e8f0
style STORAGE fill:#2e1f1b,stroke:#f59e0b,color:#e2e8f0
โ๏ธ Technical
Agent Pipeline โ Story Creation Flow
%%{init: {"theme": "dark", "themeVariables": {"primaryColor": "#a78bfa", "primaryTextColor": "#e2e8f0", "lineColor": "#f59e0b", "fontSize": "13px"}}}%%
sequenceDiagram
participant P as ๐ค Parent
participant O as ๐ฃ๏ธ Ogma
Language Guardian
participant C as ๐ Cache
participant PB as ๐ณ Papa Bois
Orchestrator
participant G as ๐ก๏ธ Guardrail
participant A as ๐ท๏ธ Anansi
Storyteller
participant D as ๐ Devi
Voice/Audio
participant F as ๐ฆ Firefly
Builder
P->>O: Voice input (mic)
activate O
O->>O: ElevenLabs STT (Scribe v1)
O->>O: Mistral Voxtral STT
O->>O: Cross-validate + detect language
O-->>C: Transcribed text + language
deactivate O
C->>C: SHA-256 hash check
alt Cache Hit
C-->>P: Return cached story
else Cache Miss
C->>PB: Forward to orchestrator
activate PB
PB->>G: Content safety check
G-->>PB: โ
Safe
PB->>A: Delegate via Agents API
activate A
A->>A: Mistral Large 3 generates story
A->>A: Split into 6 scenes
A-->>PB: Story JSON (6 scenes)
deactivate A
par Audio Generation
PB->>D: Generate narration (TTS)
PB->>D: Generate SFX per scene
PB->>D: Compose lullaby
and Illustration
PB->>F: Generate 6 scene images
F->>F: Gemini Image Generation watercolours
end
D-->>PB: ๐ Audio (narration + SFX + lullaby)
F-->>PB: ๐จ 6 illustrations
PB-->>C: Store in Turso
deactivate PB
C-->>P: Complete story with media
end
โ๏ธ Technical๐ Track Alignment
Mistral Agents API โ Orchestration Detail
%%{init: {"theme": "dark", "themeVariables": {"primaryColor": "#f59e0b", "primaryTextColor": "#e2e8f0", "lineColor": "#a78bfa", "fontSize": "13px"}}}%%
graph TB
subgraph PAPA["๐ณ Papa Bois โ Orchestrator Agent"]
direction TB
CONV["Conversations API
Multi-turn context"]
HAND["Handoffs
Agent delegation"]
TOOLS["Function Calling
4 ElevenLabs tools bound"]
end
subgraph ANANSI["๐ท๏ธ Anansi โ Storyteller Agent"]
direction TB
ML3["Mistral Large 3"]
JSON["JSON Mode
Structured scene output"]
CULT["Cultural sensitivity
Language-aware prompts"]
end
subgraph DEVI_TOOLS["๐ Devi โ Bound Function Tools"]
TTS_F["generate_tts()"]
SFX_F["generate_sound_effect()"]
LUL_F["compose_lullaby()"]
SEARCH["search_cultural_context()
via Tavily"]
end
subgraph OGMA_DUAL["๐ฃ๏ธ Ogma โ Dual STT"]
EL_STT["ElevenLabs Scribe v1"]
VOX_STT["Mistral Voxtral"]
CROSS["Cross-validation
Best-of-two"]
end
PAPA -->|handoff| ANANSI
PAPA -->|function call| DEVI_TOOLS
OGMA_DUAL -->|transcription| PAPA
style PAPA fill:#44337a,stroke:#f59e0b,color:#e2e8f0
style ANANSI fill:#44337a,stroke:#a78bfa,color:#e2e8f0
style DEVI_TOOLS fill:#1b3a2a,stroke:#34d399,color:#e2e8f0
style OGMA_DUAL fill:#1b2e4a,stroke:#818cf8,color:#e2e8f0
4 pre-registered Mistral Agents ยท Conversations for multi-turn ยท Handoffs for delegation ยท Function calling for ElevenLabs tools
๐ Track Alignment
ElevenLabs โ All 7 Tools in Architecture
%%{init: {"theme": "dark", "themeVariables": {"primaryColor": "#34d399", "primaryTextColor": "#e2e8f0", "lineColor": "#f59e0b", "fontSize": "13px"}}}%%
graph LR
subgraph INPUT["Input Layer"]
MIC["๐ค Parent Mic"]
CHAT["๐ฌ Text Chat"]
end
subgraph STT_LAYER["โ โก Speech-to-Text"]
S1["โ ElevenAgents
Jerry Concierge
Conversational"]
S2["โก Scribe v1
STT Transcription"]
end
subgraph VOICE_LAYER["โข Voice Selection"]
S3["โข Voices/Get API
Browse & Preview"]
end
subgraph AUDIO_LAYER["โฃ โค โฅ โฆ Audio Generation"]
S4["โฃ TTS Batch
Multilingual v2
10 languages"]
S5["โค TTS WebSocket
Real-time Streaming"]
S6["โฅ Sound Effects
Contextual SFX"]
S7["โฆ Music Compose
Lullaby Generation"]
end
subgraph OUTPUT["๐ 3-Layer Audio"]
NAR["Narration"]
AMB["Ambience"]
MUS["Music Bed"]
end
MIC --> S2
CHAT --> S1
S1 --> S3
S2 --> S3
S3 --> S4
S3 --> S5
S4 --> NAR
S5 --> NAR
S6 --> AMB
S7 --> MUS
style INPUT fill:#1e1b4b,stroke:#818cf8,color:#e2e8f0
style STT_LAYER fill:#1b3a2a,stroke:#34d399,color:#e2e8f0
style VOICE_LAYER fill:#1b3a2a,stroke:#34d399,color:#e2e8f0
style AUDIO_LAYER fill:#1b3a2a,stroke:#34d399,color:#e2e8f0
style OUTPUT fill:#2e1f1b,stroke:#f59e0b,color:#e2e8f0
๐ช Demo
Live App Demo
Login
Starfield animation. Pre-seeded test accounts in Turso (SQLite cloud, Tokyo region).
PBKDF2-SHA256 password hashing.
๐ Live: sandmantales-api-production.up.railway.app
๐ช Demo
Create a Story
Sophie's Story
๐ค Mic button โ Ogma dual-STT
(ElevenLabs + Mistral Voxtral)
๐ 10 languages in dropdown
๐ง Child's name woven into every scene
๐ Parent describes the day in English โ story generated in French
โ๏ธ Technical๐ช Demo
Pipeline in Action
9 Agents, Real-Time
๐ฃ๏ธ Ogma transcribes โ โ
๐ Cache checks SHA-256 โ โ
๐ณ Papa Bois orchestrates via Mistral Agents API โ โ
๐ก๏ธ Guardrail validates โ โ
๐ท๏ธ Anansi writes story with Mistral Large โ running...
๐ Devi: TTS + SFX + Lullaby โ pending
๐ฆ Firefly: Illustrations โ pending
โ๏ธ Technical
Pipeline Complete
All Green โ
Every node complete. Story, illustrations, three audio layers generated.
Cache layer: repeat prompts cost zero. SHA-256 dedup on every story + audio clip in Turso.
๐ช Demo๐ก Creativity
Sophie's Story
๐ซ๐ท Sophie et la Baleine de Nuages
๐ Six scenes in French
๐จ Watercolour illustrations (Gemini Image Generation)
๐ Three audio layers:
- Narration โ ElevenLabs Multilingual v2
- Sound effects โ ElevenLabs SFX
- Lullaby โ ElevenLabs Music Compose
๐ง 10 languages supported
๐ช Demo
Story Library
18 Stories, 10 Languages
๐ฌ๐ง ๐ซ๐ท ๐ฏ๐ต ๐ฎ๐ณ ๐ช๐ธ ๐ต๐น ๐ฉ๐ช ๐จ๐ณ ๐ธ๐ฆ ๐ฐ๐ท
Every story personalised with the child's name and world.
Every night Sophie listens, the stories grow with her. That's not a feature โ it's a relationship.
๐ Track Alignment
All 7 ElevenLabs Tools
1. ElevenAgents
Jerry โ conversational story concierge, helps parents pick voices and themes
2. Speech-to-Text
Ogma โ dual STT alongside Voxtral for cross-validation
3. Voices/Get
Voice browser โ parents preview narration voices before generating
4. Text-to-Speech
Devi โ batch narration across 6 scenes, 10 languages
5. TTS WebSocket
Real-time streaming for scene-by-scene playback
6. Sound Effects
Contextual SFX per scene โ ocean waves, birdsong, rain
7. Music Compose
Lullaby generation โ gentle music bed under narration
๐ Track Alignment
Mistral Integration
Mistral Agents API
Papa Bois orchestrates agent delegation via Conversations + Handoffs
Mistral Large 3
Anansi โ story generation, scene splitting, cultural sensitivity
Voxtral
Ogma โ speech-to-text alongside ElevenLabs for dual transcription
Vibe CLI
Firefly โ built frontend components, Papa Bois planned architecture
%%{init: {"theme": "dark", "themeVariables": {"primaryColor": "#f59e0b", "primaryTextColor": "#e2e8f0", "lineColor": "#a78bfa"}}}%%
graph LR
V["๐ค Voxtral STT"] --> PB["๐ณ Papa Bois
Agents API"]
PB --> AN["๐ท๏ธ Anansi
Mistral Large"]
PB --> VB["๐ง Vibe CLI
Firefly"]
style PB fill:#44337a,stroke:#f59e0b
style AN fill:#44337a,stroke:#a78bfa
โ๏ธ Technical
Engineering Maturity
๐จ LoRA Fine-Tuning: Trained Stable Diffusion on 20 storybook images (Gemini Image Generation generated). 500 steps, 15 minutes, 3.1MB weights.
๐งโโ๏ธ LLM-as-Judge: Pixtral evaluated base vs LoRA side-by-side.
Base SD 1.5: 6.8/10 โ LoRA: 9.0/10 โ Gemini Image Generation: 9.5/10
๐ Decision: We shipped Imagen. Knowing when NOT to use your own model is the innovation.
๐พ Prompt Cache: SHA-256 dedup. Zero-cost repeat requests. All stored in Turso cloud DB.
โ
33/33 smoke tests passing โ auth, CRUD, cached audio, story gen, all ElevenLabs endpoints, Mistral agents
๐ฏ Usefulness
Market Opportunity
50M
multilingual households
worldwide
Every night Sophie listens, the stories grow with her.
Personalisation data builds a relationship competitors can't copy overnight.
Team ClawCutters
Six mythology-inspired AI agents ยท Seven ElevenLabs tools ยท Mistral Agents API
LoRA fine-tuning ยท Pixtral judge ยท Turso cloud ยท Built in 48 hours
Bonne nuit, Sophie ๐
๐ sandmantales-api-production.up.railway.app